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ABSTRACT 

The ribosome's interactions with mRNA govern its 
translation rate and the effects of post-transcrip- 
tional regulation. Long, structured 5' untranslated 
regions (5' UTRs) are commonly found in bacterial 
mRNAs, though the physical mechanisms that 
determine how the ribosome binds these upstream 
regions remain poorly defined. Here, we systemat- 
ically investigate the ribosome's interactions with 
structured standby sites, upstream of Shine- 
Dalgarno sequences, and show that these inter- 
actions can modulate translation initiation rates by 
over 100-fold. We find that an mRNA's translation 
initiation rate is controlled by the amount of 
single-stranded surface area, the partial unfolding 
of RNA structures to minimize the ribosome's 
binding free energy penalty, the absence of 
cooperative binding and the potential for ribosomal 
sliding. We develop a biophysical model employing 
thermodynamic first principles and a four- 
parameter free energy model to accurately predict 
the ribosome's translation initiation rates for 136 
synthetic 5' UTRs with large structures, diverse 
shapes and multiple standby site modules. The 
model predicts and experiments confirm that the 
ribosome can readily bind distant standby site 
modules that support high translation rates, 
providing a physical mechanism for observed 
context effects and long-range post-transcriptional 
regulation. 

INTRODUCTION 

Long 5' untranslated regions (5' UTRs) are commonly 
found in bacterial mRNAs, and their interactions with 
the ribosome play a central role in controlling translation 
initiation and post-transcriptional regulation (1-3). 



Changes in the 5' UTR sequence, structure or overall 
shape have been found to alter an mRNA's translation 
rate, including when conformational changes are triggered 
by the binding of proteins, small RNAs or when ra-acting 
riboswitches bind their ligand (4-8). However, our under- 
standing of the physical mechanisms that determines the 
ribosome's ability to bind long 5' UTRs remains incom- 
plete. While the ribosome's interactions with Shine- 
Dalgarno (SD) sequences and nearby inhibitory mRNA 
structures have been extensively characterized (9-12) and 
are well-predicted by biophysical models (13-15), many 
natural 5' UTRs possess further upstream sequences and 
structures whose interactions with the ribosome are still 
poorly defined (Figure 1A). Here, we use a learn-by-design 
approach to systematically investigate the ribosome's 
interactions with long, structured 5' UTRs, and to 
identify new physical mechanisms that determine the ribo- 
some's ability to bind and initiate translation. 

According to crystal and cryo-EM structures, the 30S 
ribosomal complex appears to bind upstream 5' UTRs 
through its platform domain, which is a positively 
charged surface that binds non-specifically to mRNA's 
sugar phosphate backbone (3,16-18). The ribosomal 
platform first binds an initial landing pad, known as a 
'standby site' (19) (Supplementary Figure SI), followed 
by coordinated insertion of the mRNA into the Entry 
channel and the 16S rRNA's base pairing with SD like 
sequences to position the start codon over the P-site 
(18). Ribosomal SI protein likely plays a role in remodel- 
ing mRNA structure, prior to the formation of the SOS- 
initiation complex, by binding to single-stranded RNA 
and promoting the unfolding of RNA hairpins and 
duplexes; however, free energy must be inputted to 
unfold RNA structures (20). The mRNA and ribosomal 
platform bind together independently of other partici- 
pants, including tRNA^ 6 ' and the initiation factors 
(1,21). Interactions that delay binding to the 5' UTR can 
slow the mRNA's translation initiation rate, including the 
presence of mRNA structures (17,22), competitive binding 
by small RNAs (23) or the absence of a standby site 
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Figure 1. (A) Natural E. coli 5' UTRs have diverse lengths and struc- 
tures with varying binding free energy penalties to the ribosomal 
platform. Green and light blue regions represent SD and start codon, 
respectively. (B) 5' UTRs are separated into multiple standby site 
modules, followed by a SD sequence, spacer region and a protein 
coding sequence. (C) A schematic shows the mRNA regions that 
contact the ribosome in its initial and final states. The ribosomal 
platform binds to 5' UTRs with a binding free energy penalty 
AG s i an dby The sum of all binding free energies, AG lola i, determines 
how likely the ribosome binds to an mRNA and initiates translation. 



altogether (22). We consider a 5' UTR as composed of one 
or more standby site modules, followed by a 16S rRNA 
binding site (a SD sequence), a spacer region and ended 
with a protein coding sequence (Figure IB). An mRNA 
that folds into several hairpins, all located upstream of the 
SD sequence, will contain several standby site modules. 

Thermodynamic modeling has enabled a comprehensive 
dissection of the ribosome's interactions with mRNA with 
clearly defined contributions, quantitatively described by 
their free energies (Figure 1C). The most stringent test of 
our current understanding is to combine thermodynamic 
modeling with in silico optimization to design completely 
novel, non-natural RNA sequences with predicted riboso- 
mal interactions and translation initiation rates. The dif- 
ferences between model predictions and experimental 
measurements objectively draw the boundaries between 
our current understanding of RNA interactions and its 
limitations. A growing body of evidence, from our work 
and others, indicates the presence of unpredicted inter- 
actions between the ribosome and long 5' UTRs, high- 
lighted as context effects (Supplementary Figure SIC) 
(14,24-28). 

In this article, we decipher the biophysical rules govern- 
ing the ribosome's interactions with long, structured 
5' UTRs. We systematically alter 5' UTR sequence, struc- 
ture and shape to measure the ribosome's interactions 
using in vivo reporter assays. We use thermodynamic 
modeling to quantify how the ribosome binds less access- 
ible 5' UTRs, accommodates or partially unfolds RNA 
structure, selects from multiple binding sites, slides 
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across the mRNA and initiates translation. We develop 
a predictive biophysical model that combines thermo- 
dynamic minimization with a four-parameter free energy 
model to accurately predict the translation initiation rates 
of 136 long, diversely structured 5' UTRs. Finally, we 
demonstrate that the ribosome can bind to distant sites 
and engage in long-range RNA interactions, providing a 
mechanism for the observed context effects and new ways 
for riboswitches and small RNAs to regulate translation. 

MATERIALS AND METHODS 

Strains, media and plasmid construction 

Luria-Bertani (LB) media (10g/l tryptone, 5g/l yeast 
extract, 10g/l NaCl) was obtained from BD and supple- 
mented with 50ug/ml chloramphenicol (Sigma-Aldrich). 
The expression system is derived from the pFTVl 
plasmid (ColEl, Cm R ) (14) and encodes a ct 70 constitutive 
promoter (BioBrick #J23100), a synthetic 5' UTR and a 
codon-optimized mRFPl fluorescent protein. Synthetic 
5' UTRs were inserted by standard cloning techniques, 
utilizing either BamHI and SacI, or Xbal and SacI sites, 
depending on 5' UTR length. 5' UTRs with desired 
sequences were created by either annealing of complemen- 
tary oligonucleotides with overhangs, or by PCR assembly 
of oligonucleotides with corresponding restriction sites. 
Plasmid variants were cloned by restriction digest of 
DNA backbone, purification and ligation to DNA frag- 
ments with 5' UTR sequences and transformation of 
chemically competent Escherichia coli DH10B cells via 
heat shock. Correct clones were identified by DNA 
sequencing. All the designed 5' UTR sequences are 
presented in the Supplementary Data. 

Fluorescent protein and mRNA measurements 

Fluorescent protein reporter measurements were per- 
formed in 96-well format. An initial 96 deep-well plate 
containing 700 ul LB and 50ug/ml chloramphenicol was 
inoculated, from single colonies, with up to 30 different 
E. coli DH10B cultures in an alternating pattern that 
excluded the outer wells. Cultures were incubated over- 
night at 37°C with 200 rpm orbital shaking. A fresh 
96-well plate containing 200 til LB and chloramphenicol 
media was inoculated by overnight cultures using a 1:100 
dilution. Plates were incubated at 37°C in a plate-based 
spectrophotometer (TECAN M1000) with high orbital 
shaking. OD600 measurements were recorded every 
lOmin. Once a culture reached an OD600 of ~0. 15, a 
sample of each culture was transferred to a new plate con- 
taining 200 ul phosphate buffered saline [NaCl 137mM, 
KC1 2.7 mM, Phosphate buffer 10 mM (Na 2 HP0 4 / 
KH 2 P0 4 pH7.4), purchased from OmniPur] and 2mg/ml 
kanamycin (Sigma Aldrich) for flow cytometry measure- 
ments. Periodic serial dilutions were repeated two more 
times, maintaining cultures within the exponential 
growth phase. Three samples were taken from each 
culture. The single-cell fluorescence distributions of 
samples were measured using a Fortessa flow cytometer 
(BD Biosciences). All distributions were unimodal. The 
autofluorescence distribution of E. coli DH10B cells was 
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also measured. The arithmetic mean of each distribution 
was taken, and the mean autofluorescence was subtracted 
from each sample. The reported fluorescence values in this 
study are the average of three measurements {n = 3). 

For mRNA level measurements, single colonies were 
first grown overnight in 5 ml LB media with chloram- 
phenicol, then diluted to 0.01 OD600 in fresh media and 
grown to mid-exponential phase until reaching 1.0 
OD600, as measured by a cuvette-based spectrophotom- 
eter (NanoDrop 2000C). Total RNA was extracted using 
the Total RNA Purification kit (Norgen Biotek). 
Treatment with Turbo DNAse (Ambion) eliminated 
genomic DNA contamination. Reverse transcription was 
carried out with a High Capacity cDNA Reverse 
Transcription kit (Applied Biosystems), followed by quan- 
titative PCR with a TaqMan probe (5'-ACCTTCCATAC 
GAACTTT-3'), targeting the internal coding sequence for 
mRFPl (forward primer 5' - ACGTT ATC AAAG AGTTC 
ATGCGTTTC-3' and reverse primer 5'-CGATTTCGAA 
CTCGTGACCGTTAA-3') and using a ABI 7300 real- 
time cycler. A TaqMan probe for 16S rRNA was used 
as an endogenous control. Triplicate measurements were 
each performed on two separate RNA extractions. AAC t 
calculations were then averaged for each sample. 

Biophysical model calculations 

The complete biophysical model employs a free energy 
model to calculate the total binding free energy between 
the ribosome and mRNA, according to the equation 
(Figure 1C): 

AG to tal = AGmRNA-rRNA+AGspacing+AGsiart 
+ AG st andby — AG m RNA- 

Using statistical thermodynamics, we may relate this total 
Gibbs free energy change to the mRNA's translation ini- 
tiation rate, r, according to: 

rocexp^AGWi). (2) 

This relationship has been previously validated on 132 
mRNA sequences where the AG to tai varied from —10 to 
16kcal/mol, resulting in well-predicted translation rates 
that varied by over 100000-fold (14). The apparent 
Boltzmann constant, fi, has been measured as 
0.45 ± 0.05mol/kcal, which was confirmed in a second 
study (29). 

In the initial state, the mRNA exists in a structured 
conformation, where its free energy of folding is 
AG mRN A (AG mRN A is negative). After assembly of the 
30S ribosomal subunit, the last nine nucleotides of its 
16S rRNA have hybridized to the mRNA, while all non- 
clashing mRNA structures are allowed to fold. The free 
energy of folding for this mRNA-rRNA complex is 
A G m RNA-rRNA (AG mRNA -rRNA is negative). mRNA struc- 
tures that impede 16S rRNA hybridization or overlap with 
the ribosome footprint remain unfolded in the final state. 
These Gibbs free energies are calculated using a semi- 
empirical free energy model of RNA and RNA-RNA 
interactions (30,31) and the minimization algorithms 
available in the Vienna RNA suite version 1.8.5 (32). 



Three additional interactions will alter the translation 
initiation rate. The tRNA fMET anti-codon loop hybridizes 
to the start codon (AG start is most negative for AUG and 
GUG). The 30S ribosomal subunit also prefers a five 
nucleotide aligned spacing (33). Non-optimal distances 
between the 16S rRNA binding site and the start codon 
lead to an energetic binding penalty. This relationship 
between the ribosome's binding penalty ( AG spac i ng > 0) 
and aligned spacing distance was measured in our 
previous study (14). Finally, the 5' UTR binds to the ribo- 
somal platform with a free energy penalty AG sta ndby, 
which is calculated according to Equation (5) and 
described in the main text. A maximally accessible 
standby site module will bind to the ribosomal platform 
with a zero AG stan db y - Gibbs free energy calculations for 
all mRNA sequences are listed in the Supplementary 
Data. 

The biophysical model automatically decomposes a 
5' UTR into one or more standby site modules using the 
following definitions. A standby site module contains a 
single mRNA hairpin, surrounded by two single- 
stranded regions: an upstream distal binding site and a 
downstream proximal binding site. The lengths of the 
distal and proximal binding sites are denoted by D and 
P, respectively. The hairpin's height, H, is determined by 
the number of nucleotides from the hairpin's base to the 
mid-point of the hairpin's loop, including bulges, internal 
loops and base pairs. The heights of branched hairpins are 
determined by averaging the branches' heights 
(Supplementary Figure S3). Hairpins are only considered 
as components of standby site modules if they are 
mutually exclusive with the 16S rRNA binding site. 
A 5' UTR with multiple upstream hairpins will contain 
multiple standby site modules. We consider a 5' UTR 
without upstream hairpins to contain a single standby 
site module with zero hairpin height and zero distal site 
length. Using these definitions, the biophysical model cal- 
culates a AG stan dby for eac h standby site module. 

The available single-stranded surface area, A s , of a 
standby site module is determined according to 
A s = 15 + P + D - H. The key aspect of this equation is 
that proximal length, distal length and hairpin height have 
equal and interchangeable effects on the amount of single- 
stranded surface area. The constant 15 was selected to 
create a positive metric for single-stranded surface area. 
Based on data shown in Figure 2, increasing the hairpin 
height beyond 15 nt did not reduce the mRNA translation 
rate, regardless of the proximal or distal site lengths. 
Accordingly, the value of H in the equation is not 
allowed to exceed 15nt. 

The ribosomal platform's distortion energy penalty is a 
quadratic function of the available RNA surface area, A s , 
(Figure 2), according to: 

AG dislortion = C 1 (A S ) 2 +C 2 (A S )+C 3 , (3) 

where C\, C 2 and C 3 are the fitted parameter values and 
equal to 0.038 kcal/mol/nt 2 , - 1.629 kcal/mol/nt and 
17.359 kcal/mol, respectively. An alternative constant in 
the definition of surface area would alter these coefficients, 
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Figure 2. (A) Standby site modules are denned by a proximal and distal binding site, separated by an RNA structure. (B) Standby site module 
surface area is calculated by summing the proximal P and distal D binding site lengths, and subtracting the hairpin height H. The surface area is kept 
positive by adding a constant 15. (C-E) Translation rate and ribosome binding free energy penalties are measured as proximal binding site length, 
distal binding site length and hairpin height are increased. The AG slan db y numbers shown in secondary v-axis are directly related to the data 
according to Equation (4). (C) Hairpin heights are 9nt (green stars), 12 nt (black asterisks), 15 nt (red squares) or 18 nt (blue circles). (D) 
Hairpin heights are 15 nt (red squares) or 18 nt (blue circles). (E) Hairpin height varies from 9 to 12 nt by adding to loop length of a 6 bp stem 
(blue circles) or from 14 to 18 nt by adding to loop length of a 12 bp stem (red squares). (F) 56 characterized 5' UTRs (^4 S <24) from parts (C-E) are 
combined to quantitatively relate the standby site module's surface area to the ribosome's translation rate and binding free energy penalty. Dashed 
line is a best-fit quadratic equation. (G) The validity of the quadratic equation is tested by measuring translation rates from 15 additional 5' UTRs 
and comparing to model predictions (R 2 = 0.78, average error is 0.66kcal/mol). Data points are the result of three measurements in 1 day. In parts 
(C-G), the horizontal dashed line is the translation rate and ribosome binding free energy penalty of the reference unstructured 5' UTR. 



but the overall relationship between the surface area and 
the distortion free energy penalty would remain the same. 

Measurements of translation rate and binding free 
energy penalties 

In our steady-state, long-time growth conditions, transla- 
tion initiation is a rate-limiting step in codon-optimized 
mRFPl fluorescent protein expression; fluorescence levels 
are proportional to translation initiation rates under our 
controlled growth conditions (13), and translation initi- 
ation rate is related to total binding free energies accord- 
ing to a log-linear formula (Equation 2). We measure the 
change in apparent binding free energy penalty by 



comparing the fluorescence of a structured 5' UTR with 
the fluorescence of a maximally accessible 5' UTR, accord- 
ing to: 

AG s tandby = ~J Xn \Y~ \ ~ ( AAG oter), (4) 

where X is the fluorescence from a sample synthetic 
5' UTR, X re{ is the fluorescence of the unstructured 
5' UTR, and p is 0.45mol/kcal. A AG 

other is the difference 
in free energies, between the sample and reference 
5' UTRs, that depends on other ribosomal interactions 
and not on the standby site. By using identical and care- 
fully designed 16S rRNA binding sites, spacer regions and 



2650 Nucleic Acids Research, 2014, Vol. 42, No. 4 



coding sequences in the 136 synthetic mRNA sequences, 
the value for A AG ot h er was zero for 123 mRNA sequences, 
and a maximum of 0.90kcal/mol for the remaining 
sequences. 

Statistical analysis 

Two-sample T- test, at two-tailed 5% statistical signifi- 
cance, was performed to compare the error distribution 
of different sets of model predictions. The null hypothesis 
assumed that the errors in two different sets of model 
predictions have same mean and variance. The alternative 
hypothesis indicates that the errors in two sets follow dif- 
ferent distributions. The error in model prediction, 
AAG s t a ndby was calculated according to the formula: 
AAG standby = measured AG standby - predicted AG standby . 
The low probabilities (P < 0.05) reject the null hypothesis, 
showing that the differences in two sets of model predic- 
tions were statistically significant. 

RESULTS 

Standby site module surface area controls the ribosome's 
translation rate 

We first employed a learn-by-design approach to deter- 
mine the ribosomal platform's interactions with standby 
sites that contain a single hairpin, and therefore contain a 
single standby site module. Standby site modules are 
defined by their hairpin height (H), upstream distal 
binding site length (D) and downstream proximal 
binding site length (P) (Figure 2A; 'Materials and 
Methods' section). We designed, constructed and 
characterized 62 synthetic 5' UTRs containing single 
standby site modules with hairpins from 9 to 18 bp long, 
distal binding sites from 1 to 15nt long and proximal 
binding sites from 1 to 24 nt long. Long mRNA hairpins 
contain internal loops to avoid potential RNAse activity 
(34). All mRNA sequences are transcribed by the J23100 
constitutive promoter, encode the mRFPl fluorescent 
protein, and are designed with identical SD and spacer 
sequences ('Materials and Methods' section). Flow 
cytometry measurements from long-time, log-phase 
cultures with periodic serial dilutions revealed that 
mRFPl fluorescence varied by 221 -fold (Figure 2C-E). 
In contrast, mRNA levels varied by at most 5.8-fold, 
according to RT-qPCR measurements (Supplementary 
Figure S2). Changes in fluorescence are primarily due to 
changes in translation rate and, in these growth condi- 
tions, fluorescence and translation initiation rates are pro- 
portional. All sequences and measurement data are listed 
in the Supplementary Data. 

According to the data, standby site module accessibility 
plays an important role in modulating the ribosome's 
interactions with 5' UTRs and controlling translation 
rate across a large range. In particular, lengthening the 
distal binding site by four nucleotides increased the 
mRNA's translation rate by 4.2-fold, whereas lengthening 
the proximal binding site by the same amount increased it 
by 4.0-fold. Shortening the hairpin's height by 4 bp 
increased the mRNA's translation rate similarly by 3.1- 
fold. More generally, the slopes (first derivatives) of the 



log-transformed translation rates all have similar magni- 
tudes (Supplementary Data, column E). Highly accessible 
standby site modules with long proximal binding sites or 
short hairpins also cause the mRNA's translation rate to 
reach a plateau with the same fluorescence level as an 
mRNA with an unstructured 5' UTR (Figure 2C). These 
observations led to the hypothesis that the geometric char- 
acteristics of the standby site module (D, P, H) are inter- 
changeable, and that its available single-stranded surface 
area, A s , determines the ribosomal platform's ability to 
bind to it. 

A standby site module's single-stranded surface area is 
the number of nucleotides capable of single-stranded 
contact with the ribosomal platform. Longer proximal 
or distal binding sites increase the standby site module's 
surface area, while longer hairpin structures reduce the 
standby site module's accessibility. mRNA structures 
can potentially reduce site accessibility by folding over 
the neighboring single-stranded sites and sequestering 
binding regions. A standby site module's available 
surface area is calculated by summing its proximal and 
distal binding site lengths and subtracting the hairpin's 
height (Figure 2B; 'Materials and Methods' section). A 
highly accessible standby site module has a large A s , 
whereas a less accessible standby site module has a small 
A s . We then evaluated whether the standby site module's 
surface area is the key determinant by comparing A s to the 
measured log-transformed translation rates. For all 62 
5' UTRs with diverse geometric features, we found a 
single quantitative relationship that described the 
effect of surface area on the mRNA's translation rate 
(Figure 2F). 

We now turn to thermodynamic modeling to quantify 
the ribosome's interactions with standby site modules. The 
competitive binding of free ribosomes to the pool of intra- 
cellular mRNAs allows one to use statistical thermo- 
dynamics to relate a ribosome's binding free energy to 
its translation initiation rate, according to a constant 
log-linear relationship (Equation 2) (13,14). We calculate 
the interaction energy between the ribosome and an 
mRNA's standby site modules by comparing its transla- 
tion to the translation rate of a reference, unstructured 
5' UTR, both with an identical SD sequence, spacer 
region and protein coding sequence (Equation 4). Using 
the data in Figure 2C-E, we found that the standby site 
binding free energy penalty, AG stan dby, varied from 0 to 
11.8kcal/mol when changing its geometric characteristics, 
and that a simple quadratic function succinctly described 
the effect of site accessibility on the ribosome's binding 
free energy penalty (R 2 = 0.97, average error is 
0.51 kcal/mol) (Figure 2F). A maximally accessible 
standby site module has a zero binding free energy 
penalty (AG stan db y = 0), indicating that the mRNA's 
translation initiation rate is not reduced by the ribosomal 
platform's interactions. A standby site module with a 
positive binding free energy penalty (AG stan db y > 0) has 
reduced the mRNA's translation initiation rate by 
decreasing the ribosomal platform's ability to initially 
bind the mRNA. 

Next, we critically tested the parameterized model by 
constructing 15 new 5' UTRs and measuring their 
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translation initiation rates and ribosome binding free 
energy penalties. The 5' UTRs each contain a standby 
site module with diverse structures, between 43 and 91 nt 
long, including variable-length distal and proximal 
binding sites, hairpin heights, and including two- and 
three-branched structures (Supplementary Data). For 
branched structures, we show that the relevant hairpin 
height is the average across the multiple branches 
(Supplementary Figure S3) (see also Supplementary 
Text). The quadratic relationship accurately predicted 
the ribosome's translation rate and binding free energy 
penalty (R z = 0.78, average error is 0.66kcal/mol), con- 
firming that standby site module's accessibility controls 
the ribosome's ability to bind and initiate translation 
(Figure 2G). The quadratic relationship contains three 
parameters, which are now treated as constants 
(Equation 3; 'Materials and Methods' section). 



Coupled energetic trade-offs when unfolding structures in 
standby site modules 

Prior to initiating translation, the 30S ribosome does not 
have an external source of energy. Initiation factor 2 does 
not hydrolyze GTP until after translation has been 
initiated, during 50S recruitment (22). Consequently, the 
30S complex expends its own free energy to unfold mRNA 
structures. Previous studies have shown that mRNA struc- 
tures that sequester the 16S rRNA binding site are 
unfolded prior to initiating translation (9,14,22). The 
mRNA's translation initiation rate is reduced according 
to the free energy penalty for unfolding these structures, as 
described in Equations (1) and (2) ('Materials and 
Methods' section). However, it remains unclear whether 
mRNA structures located upstream of SD sequences, 
within standby site modules, are unfolded or 
accommodated. We next investigated how the ribosome 
unfolds hairpins in long 5' UTRs. 

By definition, there is a coupled thermodynamic rela- 
tionship between the unfolding of an mRNA hairpin and 
increasing the accessibility of the corresponding standby 
site module, where the accessibility of standby site module 
controls the ribosomal platform's distortion energy 
penalty AGdistortion ( see 'Materials and Methods' 
section). Unfolding a single base pair in a module's 
hairpin will increase the standby site module's surface 
area by three nucleotides, the hairpin's height decreases 
by one and the lengths of both the proximal and distal 
binding sites increase by one. Therefore, unfolding a 
module's hairpin will lead to higher accessibility, a lower 
AGdistortion an d a higher translation rate. However, un- 
folding base pairs requires an input of free energy 
(AG un f 0 idi n g > 0), which will lead to a lower translation 
rate. Consequently, there is an energetic trade-off 
between unfolding mRNA structures, and increasing the 
surface accessibility of standby site modules. The 
ribosome may completely unfold a structured standby 
site module to maximize its surface area, but the energetic 
cost may be high (high AG unfo i ding penalty with 
AG distortion = 0). In contrast, the ribosome may fully ac- 
commodate a standby site module's hairpin, without 



unfolding it, even though its low surface accessibility will 
penalize the ribosome's ability to bind to it (high 
AG distortion penalty with AG unfolding = 0). 

Based on thermodynamic principles, we hypothesize 
that the ribosome will 'selectively' and 'partially' unfold 
structured standby site modules to minimize its total 
binding free energy penalty (AG standby = AG distortion + 
AG un foi d i ng ). Figure 3A and B presents an illustrative 
example of this principle and our use of minimization to 
calculate the ribosome's binding free energy penalty to a 
standby site module. A 5' UTR with a single standby site 
module initially contains a short proximal binding site 
(P = 2 nt), a short distal binding site (D = 2 nt) and a 
long hairpin (H = 15 nt), resulting in a largely inaccessible 
5' UTR (A s = 4). Based on the standby site module's 
surface area, the ribosome has an initial distortion 
energy penalty of 1 1.5 kcal/mol. However, the ribosome 
can selectively unfold the hairpin's closing A:U base 
pairs, increasing the standby site module's surface area 
and lowering its AGdistortion at the expense of increasing 
its AG U nfoi d ing- Unfolding 1 bp increases the standby site 
module's surface area by 3 nt, decreases the distortion 
energy penalty to 7.8 kcal/mol and increases the unfolding 
energy penalty to 0.90 kcal/mol, leading to a total binding 
free energy penalty of 8.7 kcal/mol. The ribosome can 
continue to unfold base pairs to minimize its total 
binding free energy penalty; unfolding 3 bp at a cost of 
2.7 kcal/mol leads to a distortion energy penalty 
of 2.6 kcal/mol, and a total binding free energy penalty 
of 5.3 kcal/mol (Figure 3B). Two additional examples 
are shown in Supplementary Figure S4. The unfolding 
free energies are calculated using the Vienna RNA suite 
with the Turner 1999 RNA parameters (see 'Materials and 
Methods' section). Importantly, thermodynamic mini- 
mization does not add any new parameters to the model. 

We tested the proposed mechanism by characterizing the 
translation rate of 22 synthetic 5' UTRs (Figure 3C-E) that 
contain single standby site modules with varied geometries 
(D = 0 to 7 nt and P = 1 to 20 nt) and differing hairpin 
sequences, sizes and folding energetics (Supplementary 
Figure S5A). We employed the minimization principle to 
calculate the number of unfolded base pairs and to predict 
the ribosome's binding free energy penalty, AG stan dby 
(Figure 3D). As comparisons, we performed the same cal- 
culation while asserting that either no additional energy 
was inputted to unfold hairpins (high AGdistortion an d 
AGunfoiding = 0) (Figure 3C) or that hairpins were com- 
pletely unfolded by the ribosome (high AG un foiding an d 
AGdistortion = 0) (Figure 3E). 

The data supports our hypothesis that the ribosome 
partially and selectively unfolds structured standby site 
modules to minimize its total binding free energy penalty 
(AGstandby). rather than separately minimize its distortion 
or unfolding penalties (average error is 0.90 kcal/mol when 
minimizing AGstandby, 2.42 kcal/mol when AG un f 0 idi ng = 0 
and 10.24 kcal/mol when AGdistortion = 0; comparative 
P-values are 0.004 and 10 ~ 13 , respectively). 
Remarkably, in all 22 cases, only 1-3 bp of each hairpin 
are predicted by the model to be unfolded, while 
accommodating the remaining large RNA structures. 
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Non-cooperative binding to multiple standby site modules 

We have shown that the ribosome can accommodate large 
structures of standby site modules and engage in frequent 
translation, but it is unclear how the presence of multiple 
standby site modules in a 5' UTR will affect its translation 
rate. The presence of multiple standby site modules could 
allow the ribosome to cooperatively bind mRNA and 
increase its translation rate. We next characterized nine 
synthetic 5' UTRs (76-164 nt long) with up to four 
standby site modules to investigate the potential for co- 
operative binding (Figure 4A). If the ribosome can engage 
in cooperative binding, then the additional standby site 
modules will increase the overall available surface area, 
reduce the ribosome's binding free energy penalty and 
increase the translation initiation rate. In particular, we 
expect the greatest cooperativity when the available 
surface area of a single standby site module is limited. 

We found that 5' UTRs with two, three or four 
standby site modules were all translated at the same 
rate, indicating an absence of cooperative binding 



(Figure 4B), including when the available surface areas 
for individual standby site modules were significantly 
lowered. Therefore, even when the combined surface 
areas of multiple standby site modules could potentially 
increase contact with the ribosomal platform and increase 
binding, a cooperative increase in translation rate was not 
observed. Consequently, the data supports a mechanism 
whereby the ribosome binds to only a single standby site 
module prior to initiating translation. 

Distant standby site modules can support high 
translation rates 

We next investigated whether any standby site module 
could be bound by the ribosome, even if located far 
upstream of the start codon. Using the model's predic- 
tions, we forward-engineered multi-standby site module 
synthetic 5' UTRs where the most distantly upstream 
site is the only one capable of supporting high translation 
rates (Figure 4C). Internal standby site modules have long 
hairpins and limited surface areas (model predictions: 
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(blue bars) of 5' UTRs M[ to M 4 , compared to an unstructured 5' UTR (see also Supplementary Figure S2). (E) Fluorescence measurements of 5' 
UTRs Mj to M 4 show that upstream standby site modules can support high translation rates. (F) Model predictions are shown for 5' UTRs M[ to M4 
using either AG dislorlion (gray bars) or AGj islor , ion + AG^ing (blue bars) for each standby site module (I-IV, corresponding to labels in part C), 
compared to the ribosome's apparent binding free energy penalties (dashed lines, yellow region) (average error with and without the sliding penalty is 
1.15 and 3.24kcal/mol, respectively; P-value is 0.013). AGdistortion for each standby site module was calculated according to Equation (3). In parts B 
and E, the horizontal dashed line is the translation rate and ribosome binding free energy penalty of the unstructured 5' UTR. 



AG s t an dby between 9.0 and 17.4kcal/mol) compared to the 
maximally accessible upstream standby site modules. 
Therefore, if the ribosome binds the farthest upstream 
standby site module, its translation rate will be at least 
12.7-fold higher than binding to the internal standby site 
modules. We measured fluorescence levels with flow 
cytometry and mRNA levels with RT-qPCR. Compared 
to an unstructured 5' UTR, the mRNA levels decreased by 
only 1.3- to 5.5-fold, while the fluorescence levels were 
correspondingly reduced between 2.5- and 18.3-fold, 
showing that changes in fluorescence were mainly due to 
changes in translation rate and ribosome binding free 
energy penalties (Figure 4D). 

We found that all the mRNAs' translation rates 
remained very high, even when the accessible standby 
site module was located over 100 nt upstream of the 
start codon (M4 5' UTR) (Figure 4E). In particular, the 
M2 5' UTR is translated at a 33.8-fold higher rate when 
compared to the translation rate that could be supported 
by binding to the standby site module closest to the start 
codon. Moving the accessible standby site module 
upstream, away from the start codon, does not signifi- 
cantly reduce the translation rate. Based on these meas- 
urements, the apparent standby site binding penalties 
are 3.6, 6.2 and 6.5kcal/mol for M2 to M4 5' UTRs 



(Figure 4F), respectively, which is much lower than the 
internal standby site modules' predicted binding free 
energy penalties (9.0-1 7.4 kcal/mol). These measurements 
suggest that the ribosome will bind to the standby site 
module that has the lowest binding free energy penalty, 
regardless of its distance from the start codon. 

The ribosomal platform can slide across 
upstream structures 

Several mechanisms would allow the ribosomal platform 
to bind distant, upstream standby site modules and tran- 
sition to a pre-initiation complex at the start codon. After 
the ribosome has bound to the upstream standby site 
module, the 30S complex could engage in diffusive 
hopping that would increase its local concentration and 
assembly rate at the start codon. The rate of assembly 
would largely depend on the physical distance between 
the standby site module and start codon, and not on the 
presence of RNA structures. Alternatively, the ribosome 
could employ a full contact, sliding mechanism that reori- 
ents the mRNA until a stable pre-initiation complex has 
formed. The positively charged surface of the ribosomal 
platform makes it ideal for maintaining contact with a 
5' UTR until 16S rRNA hybridization has anchored the 
mRNA. Unlike a diffusive hopping mechanism, the 
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presence of mRNA structures would impede this sliding 
motion and reduce the pre-initiation complex's assembly 
rate. 

Further analysis of our measurements shows that RNA 
structures in between the upstream standby site module 
and start codon partly attenuate the mRNA's translation 
rate (Figure 4E). The apparent binding free energy penalty 
of the M2 5' UTR (3.6kcal/mol) is higher than the 
upstream standby site module's predicted binding free 
energy penalty (0.4kcal/mol). This energetic difference 
cannot be explained by binding to the internal standby 
site module (11.4kcal/mol), or by the unfolding of the 
downstream RNA structure (17.5 kcal/mol). Thus, the 
free energy difference (3.6 - 0.4 = 3.2 kcal/mol) is signifi- 
cant, but much lower than predicted by alternative 
binding to other standby site modules or from unfolding 
of RNA structures. 

Our data supports the presence of a sliding mechanism 
for the ribosomal platform. Intervening mRNA structures 
are not unfolded, but are pushed aside with an energetic 
cost. Using our measurements of the M2 5' UTR, we 
estimate that 3.2 ± 0.5 kcal/mol free energy is required 
to push aside the single RNA structure with a height of 
15 nt, yielding a coefficient of 0.20 ± 0.04 kcal/mol per 
hairpin height. We then determined whether including 
this sliding penalty could improve the model's ability to 
predict the translation rate of long 5' UTRs with multiple 
standby site modules with 'hairpins of differing heights'. 
For each standby site module, we calculate the heights of 
all downstream RNA structures, and multiplied the sum 
by the coefficient (c = 0.2 kcal/mol/nt) to yield a free 
energy penalty of sliding, A Gliding- The sliding free 
energy penalties were 0, 3.0, 4.8 and 7.8 kcal/mol for Ml 
to M4 5' UTRs, respectively. We add together the sliding, 
distortion and unfolding free energy penalties to calculate 
the ribosome's binding free energy penalty to each standby 
site module. 

Figure 4E shows the measured translation initiation 
rates for Ml to M4, alongside the predicted binding free 
energy penalties (Figure 4F), both with and without the 
sliding free energy penalty. The data shown in Figure 4E 
and F are on the same scale, according to the log-linear 
relationship between translation initiation rate and 
binding free energy differences ('Materials and Methods' 
section), enabling a comparison between the measure- 
ments and the two predictions. The introduction of the 
sliding free energy penalty enabled the biophysical model 
to more accurately predict the translation rate from longer 
5' UTRs with several standby site modules (average error 
is 1.86 kcal/mol with the sliding energy penalty, compared 
to 4.45 kcal/mol without the sliding energy penalty; 
P-value is 0.013). 

A biophysical model predicts translation rates from 
diversely structured 5' UTRs 

Altogether, the ribosome binds to 5' UTRs by selecting 
the most geometrically accessible standby site module that 
requires the least amount of RNA unfolding. Standby site 
module selection is not limited by distance to the start 
codon, and upstream standby site modules can support 



high rates of translation. We identified that ribosomal dis- 
tortion, selective RNA unfolding and ribosomal sliding 
control standby site module selection and binding ener- 
getics, and combine them together to create a single bio- 
physical model. The ribosome's binding free energy 
penalty to the standby site modules in a structured 
5' UTR is calculated according to 

AG s t an dby = AG c ij slort j on + AG un f 0 idj n g+ A Gliding (5) 

All three Gibbs free energies are positive, and a long, un- 
structured 5' UTR would bind to the ribosomal platform 
without an energetic penalty (AG stan db y = 0 kcal/mol). All 
RNA energetics are calculated using a semi-empirical free 
energy model of RNA and RNA-RNA interactions 
(30,31) and the minimization algorithms available in the 
Vienna RNA suite version 1.8.5 (32). This model extends 
our previous biophysical model that focused on down- 
stream mRNA interactions, including the 16S rRNA 
binding site, spacer region and downstream RNA struc- 
tures (13,14). 

We further tested the accuracy of the biophysical model 
by characterizing an additional 28 diversely structured 
5' UTRs (68-1 64 nt long) with two to four standby site 
modules of varying geometries, RNA structure energetics 
and hairpin heights (Supplementary Figure S5B). The bio- 
physical model was able to accurately predict the relative 
translation rates and ribosome binding free energy 
penalties (average error is 0.79 kcal/mol and R 2 = 0.83) 
(Figure 5A). Overall, the biophysical model contains 
four, empirically determined parameter values, but can 
accurately predict the binding free energy penalties and 
translation rates of the 136 synthetic 5' UTRs 
characterized in this study (average error is 0.75 kcal/mol 
and R 2 = 0.89) (Figure 5B). 

Genome-wide predictions of 5' UTR translation rates 

We next employed the biophysical model to examine how 
standby sites in genomic 5' UTRs control their translation 
initiation rates. From the annotated E. coll MG1655 
genome, provided by EcoCyc (35), we collected 3430 
5' UTRs that varied from short leaders to gene-sized regu- 
latory hot spots with average 5' UTR length of 100 nt. The 
5' UTR sequences from the +1 of the mRNA transcript to 
+ 100 after the protein coding sequence's start codon were 
inputted into the biophysical model. For individual 
5' UTRs, a list of standby site modules was automatically 
generated. For each standby site module, the surface area 
A s , distortion energy penalty AGdistortion, unfolding energy 
penalty AG unfo i din g and sliding energy penalty AG s | iding 
were calculated. These calculations are combined to deter- 
mine the AG s t an dby for each standby site module. Because 
of the absence of cooperative binding, as shown, the 
standby site module with the lowest binding free energy 
penalty is selected as the one controlling the ribosome's 
translation initiation rate. 

According to the model calculations, we found that the 
lengths of the natural 5' UTRs do not correlate with their 
ability to bind the ribosomal platform (Figure 6A). Short 
5' UTRs with low accessibility can inhibit translation 
equally well, compared to long 5' UTRs where 
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accessibility is limited by RNA structures. The model cal- 
culations indicate that the ribosomal platform can bind to 
natural 5' UTRs with a wide range of energetic penalties 
(up to 17.4 kcal/mol), repressing translation by up to 2500- 
fold (Figure 6B). Long 5' UTRs with low AG standby can be 
subjected to translation repression when trans-acting 
small RNAs bind and reduce their single-stranded acces- 
sibility (23,29,36-38), or when m-acting riboswitches 
reduce accessibility when bound to a chemical ligand 
(39—43). In one example, the standby site in the tisB 
5' UTR is occluded when bound by the IstR-1 small 
RNA, and its translation is repressed (23). In contrast, 
long 5' UTRs with high AG standby can be targets for 



translation activation when structural remodeling by 
small RNAs or riboswitches increases their surface acces- 
sibility. RprA small RNA has been shown to bind and up- 
regulate the translation of rpoS mRNA with the help of 
Hfq-protein (44). 

The model predicts that both types of translation regu- 
lation occur in natural 5' UTRs. In particular, 39% of 
genomic 5' UTRs have standby site modules whose 
distal or proximal binding sites are longer than 15 nt, 
which are potent targets for translational repression; in 
contrast, 9% of 5' UTRs contain standby site modules 
with low accessibilities (^4 S < 10), making them targets 
for translation activation. Moreover, 79 diverse 5' UTRs 
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with an average length of 175 nt have positive sliding 
energy penalties (AG s iidi ng ), indicating that distant 
standby site modules are controlling translation, and 
making them potential targets for long-range regulation. 

Further, there are 769 operons where variability in their 
transcriptional start sites, due to adjacent or overlapping 
promoters, result in 5' UTRs with different lengths and 
mRNA structures. For example, the mmuPM operon has 
five different transcriptional start sites, leading to 5' UTR 
isoforms with lengths from 9 to 205 nt. The biophysical 
model predicts a 1088-fold change in the translation initi- 
ation rate of mmuP, as a result of changes in the structure 
and surface area of the different standby site modules. 
Overall, across all 5' UTR isoforms within the genome, 
model calculations show large changes in translation ini- 
tiation rates through alterations in the standby site 
modules' characteristics (Figure 6C). These differences 
imply that switching promoter usage, due to changing 
transcription factor or sigma factor levels, can modulate 
an mRNA's translation rate, and potentially create 
standby site modules whose accessibility can be addition- 
ally regulated by translation factors. However, further 
investigation is necessary to examine the 5' UTRs of 
isoforms and the regulation of their translation rates. 

Biophysical modeling to calculate translation rates from 
split ribosome binding sites 

Sacerdot et al. (45) studied the translation of the 5' UTR 
of E. coli thrS mRNA that uses a split ribosome binding 
site. The biophysical model classifies the 5' UTR as con- 
taining three standby site modules with large RNA struc- 
tures and varying distal and proximal binding site lengths 
(Figure 7). The authors concluded that a 24 nt long single 
stranded region (domain 3) is essential for ribosome 
binding and that the ribosome can accommodate a long 



hairpin (domain 2) without unfolding it, which was later 
supported by crystallographic data (3). The biophysical 
model's calculations are strongly consistent with their con- 
clusions. In particular, a mutation that deleted the entire 
domain 3 and its upstream region (ILOA4) repressed the 
translation rate by over 50-fold, where it significantly 
reduced the standby site accessibility (from A s = 24 and 
AG distortion = 0 to A s = 0 and AG distortion = 17.4 kcal/ 
mol). To offset this large energetic burden, model calcu- 
lations indicate that the ribosomal platform partially 
unfolded 6 bp from the domain 2 hairpin, which 
minimized its binding free energy penalty to a AG stan dby 
of 5.32kcal/mol (A s = 20.5, AGdistortion = 0.02 and 
AG un foiding = 5.3 kcal/mol). Destabilization of the 
domain 2 hairpin (ILOA5) restored the mRNA's transla- 
tion to its wild-type rate by increasing the surface area of 
the standby site to 20.5 nt, resulting in an almost zero 
AG s t an dby The biophysical model calculations indicate 
that other mutations to the thrS 5' UTR (L6, Ml, Nl, 
BS4-9, BS4-9/CS29, L19, L7, CS30A2, ILOA1 and 
ILOA2; see Figure 7) did not alter its standby site 
module surface area or RNA structure; the maximum 
observed change in translation was 2.5-fold for these 
mutations. 

DISCUSSION 

We designed and characterized synthetic mRNA se- 
quences and applied biophysical modeling to decipher 
the rules for the ribosome's interactions at structured 
5' UTRs. The 30S ribosome searches for a single 
standby site module that supports the highest translation 
rate, regardless of its distance from the start codon. The 
ribosome's binding free energy penalty to individual 
standby site modules is governed by a competing trade- 
off between the accessibility of standby site modules 



Nucleic Acids Research, 2014, Vol. 42, No. 4 2657 



(Figure 2) and the unfolding of RNA structures to 
increase accessibility (Figure 3). We demonstrate that 
thermodynamic minimization predicts the extent of 
the ribosome's selective and partial unfolding of RNA 
structures. Our data supports a sliding mechanism 
whereby downstream RNA structures are not unfolded, 
but attenuate translation rate in a height-dependent 
manner (Figure 4). 

Importantly, initially designed sequences have well- 
defined features that perturb the ribosome's individual 
interactions with mRNA, allowing us to precisely 
measure their effect on the ribosome's binding free 
energy penalty. The forward design of synthetic sequences, 
according to model predictions, quantitatively tests our 
knowledge of these interactions and the model's ability 
to predict their strengths. Overall, the biophysical model 
employs first-principles calculations with four empirically 
measured constants to accurately predict the translation 
initiation rates of 136 mRNAs with diversely structured, 
long 5' UTRs (Figure 5). 

Based on our findings, there is also the potential for 
long-range regulation of translation. Long, structured 
5' UTRs can feature several upstream standby site 
modules extending over lOOnt away from an open 
reading frame. Any of these standby site modules are 
potential binding sites for the ribosome, according to 
their binding free energy penalties, but only a single 
standby site module is ultimately bound. Both m-acting 
and ;ra77.y-acting factors can modulate individual standby 
site module surface accessibilities to control standby site 
module selection and the resulting translation rate 
(3,17,23). Importantly, the regulation of standby site 
module selection will produce discrete step changes in 
translation rate that can further regulate downstream 
processes according to non-Boolean, multi-state logic. 

In this study, we introduce a new metric to quantify the 
surface area of a standby site module, which was validated 
using diverse, but canonical, RNA secondary structures. 
The metric relates the geometric characteristics of the 
standby site module (P, D, H) to its available single- 
stranded surface area, implicitly using coefficients of one 
to indicate interchangeability. Additional measurements 
will be needed to precisely measure these coefficients, 
and relate them to the helical shape of A-form RNA. In 
addition, tertiary structures, such as G-quadruplexes and 
pseudoknots, will likely have a differently defined metric 
of surface area. Further investigation will be necessary to 
determine the effect of tertiary RNA structures on standby 
site accessibility, ribosome binding and translation 
initiation. 

As suggested by the biophysical model, the ribosomal 
platform's ability to bind standby site modules with low 
surface area relies on its structural flexibility. Previously 
published cryo-EM data shows that the platform domain 
in the 30S subunit, in both the free and 50S-bound states, 
can occupy variable conformational states (46). The two 
halves of the platform, which are stabilized by ribosomal 
proteins at its surface, switch conformations during trans- 
lation (47). Restraining this flexibility will reduce the ribo- 
some's translation initiation rate; for example, when the 
antibiotic Edeine binds 16S ribosomal RNA helices H23 



and H24 (48). Consequently, the ribosomal platform's 
structural flexibility appears to be an essential aspect of 
translating mRNAs with structured 5' UTRs. 

Similarly, the rigidity of single-stranded RNA likely 
plays an important role in modulating the ribosome's dis- 
tortion penalty, due to the entropic differences of binding 
flexible or rigid macromolecules (49). Similar to DNA, 
polyA RNA sequences are less flexible than mixed or 
polyU RNA sequences (50,51). Differences in RNA 
rigidity have been shown to alter the ribosome's binding 
affinity to spacer regions separating the SD and start 
codon sequences (52); binding free energy differences are 
~2.5kcal/mol between polyA and polyAU spacer se- 
quences, and ~5kcal/mol between polyA and polyU 
spacer sequences. Incorporating a sequence-dependent 
model for RNA rigidity into the distortion penalty calcu- 
lations could potentially increase their accuracy. 

Analysis of the model's error can lead to further insight 
into the ribosome's interactions. The model's predictions 
have a normally distributed error for 90% of sequences 
(Supplementary Figure S6). Thirteen outliers have errors 
between 2 and 4kcal/mol (Supplementary Data). Some 
outliers are due to the effects of increased mRNA degrad- 
ation of 5' UTRs with very long proximal or distal binding 
sites (P, D>20nt) (Supplementary Figure S2). In others, 
extremely short proximal or distal binding sites can result 
in non-canonical base pairings that alter the ribosome's 
binding affinity. For example, in the absence of a 
proximal binding site (P = 0 nt), co-axial stacking will 
take place between the nucleotides in the distal binding 
site and the mRNA-rRNA duplex that anchors the 
ribosome to the mRNA. Similarly, co-axial stacking will 
play a larger role in predicting the effects of small RNAs 
that bind to sites nearby RNA structures. 

As the list of interactions that control gene expression 
continues to grow, we will stretch the limits of human 
pattern recognition and the use of observational correl- 
ations. Biophysical modeling offers a comprehensive 
approach to account for all known interactions with the 
gene expression machinery, identify gaps, design experi- 
ments, precisely measure strengths and make testable pre- 
dictions. The accuracy of these models will go hand-in-hand 
with our ability to engineer genetic systems without trial- 
and-error. We have incorporated the ribosome's inter- 
actions with structured standby site modules into an 
improved biophysical model, called the RBS Calculator 
v2.0. A software implementation and user-friendly web 
interface is available at http://salis.psu.edu/software. 
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Supplementary Data are available at NAR Online. 
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